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Abstract In this paper we introduce a generalized Sobolev space by defining a semi-inner 
product formulated in terms of a vector distributional operator P consisting of finitely or 
countably many distributional operators P„, which are defined on the dual space of the 
Schwartz space. The types of operators we consider include not only differential operators, 
but also more general distributional operators such as pseudo-differential operators. We de- 
duce that a certain appropriate full-space Green function G with respect to L := P* r P now 
becomes a conditionally positive definite function. In order to support this claim we ensure 
that the distributional adjoint operator P* of P is well-defined in the distributional sense. 
Under sufficient conditions, the native space (reproducing-kernel Hilbert space) associated 
with the Green function G can be isometrically embedded into or even be isometrically 
equivalent to a generalized Sobolev space. As an application, we take linear combinations 
of translates of the Green function with possibly added polynomial terms and construct a 
multivariate minimum-norm interpolant .sy x to data values sampled from an unknown gen- 
eralized Sobolev function / at data sites located in some set X c M, d . We provide several 
examples, such as Matern kernels or Gaussian kernels, that illustrate how many reproducing- 
kernel Hilbert spaces of well-known reproducing kernels are isometrically equivalent to a 
generalized Sobolev space. These examples further illustrate how we can rescale the Sobolev 
spaces by the vector distributional operator P. Introducing the notion of scale as part of the 
definition of a generalized Sobolev space may help us to choose the "best" kernel function 
for kernel-based approximation methods. 
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1 Introduction 

A large and increasing number of recent books and research papers apply radial basis func- 
tions or other kernel-based approximation methods to such fields as scattered data approx- 
imation, statistical or machine learning and the numerical solution of partial differential 
equations, e.g., [2,3,4.5 7 10 14 15 20 21 1. Generally speaking, the fundamental under- 
lying practical problem common to many of these applications can be represented in the 
following way. Given a set of data sites X c K. d and associated values Y c R sampled 
from an unknown function /, we use translates of a kernel function <t> and possible poly- 
nomial terms to set up an interpolant s/jc to approximate the function /. When / belongs 
to the related native space of <t>, we can obtain error bounds and optimality properties of 
this interpolation method. If <P is only conditionally positive definite (instead of the more 
straightforward positive definite case), then it is known that the native space can also be- 
come a reproducing-kernel Hilbert space with a reproducing kernel computed from <P along 
with additional polynomial terms (see Section[3]and [21 1). Nevertheless, there still remain a 
couple of difficult and challenging questions to be answered for kernel methods: What kind 
of functions belong to the related native space of a given kernel function, and which kernel 
function is the best for us to utilize for a particular application? In particular, a better un- 
derstanding of the native space in relation to traditional smoothness spaces (such as Sobolev 
spaces) is highly desirable. The latter question is partially addressed by the use of techniques 
such as cross-validation and maximum likelihood estimation to obtain optimally scaled ker- 
nels for any particular application (see e.g., [19 201). However, at the function space level, 
the question of scale is still in need of a satisfactory answer. As we will illustrate shortly, the 
definition of our generalized Sobolev spaces will include a notion of scale in a rather natural 
way. 

We will deal with these questions in a different way than the authors of the survey 
paper [ 14] did. In this paper, we want to show that the kernel functions and native spaces 
(reproducing kernels and reproducing-kernel Hilbert spaces) can be computed via Green 
functions and generalized Sobolev spaces induced by some vector distributional operators 
P := (Pi, ■ ■ ■ , P„, ■ ■ ■ ) T consisting of finitely or countably many distributional operators P„ 
(see Definition 14.1b . We can further check that differential operators are special cases of 
these distributional operators. 

Some well known examples covered by our theory include the Duchon spaces and 
Beppo-Levi spaces associated with polyharmonic splines (see Examples l5.1l and[ 5.6l l. More- 
over, in 1131 the author expressed a desire to choose the "best" scale parameter of a given 
kernel function for a particular interpolation problem by looking at scaled versions of the 
classical Sobolev space via different scale parameters. Examples 12. 1 1 15 . 3 l and 15 .7 I tell us that 
we can balance the role of different derivatives by selecting appropriate scale parameters 
when reconstructing the classical Sobolev spaces by starting with appropriately chosen in- 
ner products of for our generalized Sobolev spaces. Finally, Example 15.81 shows that the 
native space of the ubiquitous Gaussian function (the reproducing-kernel Hilbert space of 
the Gaussian kernel) is isometrically equivalent to a generalized Sobolev space, which can 
be applied to support vector machines and in the study of motion coherence (see e.g., 1171 
|23]). 

In this article, we use the notation Re(£) to be the collection of all real-valued functions 
of the function space £. For example, Re(C(R rf )) denotes the collection of all real-valued 
continuous functions on R.''. SI is defined as the collection of slowly increasing functions 
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which grow at most like any particular fixed polynomial, i.e., 

SI := {/ : R d -> C : f(x) = OQ\x\\% ) as ||x 2 || -> co for some m e N } . 

(The notation / = 0(g) means that there is a positive number M such that \f\ < M|g|.) 
Roughly speaking, our generalized Sobolev space is a generalization of the classical real- 
valued L2(R rf )-based Sobolev space. The real classical Sobolev space is usually given by 

<H"(R rf ) := {/ 6 Re(L| OC (R d )) n SI : D° f e L 2 (R d ) for all \a\ < n, a e N„] 

with inner product 

Our concept of a real generalized Sobolev space (to be defined in detail in Definition l4.4l 
below) will be of a very similar form, namely 

oo 

H P (R rf ) := {/ g Re(Lf (R d )) n : {Pjf}J =i C L 2 (R rf ) and £ ||iy£<*> < TO } 

7=1 

with the semi-inner product 

oo _ 

(f.*W) : =X I ^/(*W*)dx, /,geH P (R d ). 
1=1 J R " 

Why do we use different vector distributional operators to set up the generalized Sobolev 
space? An important feature driving this definition is the fact that this will give us different 
semi-norms in which to measure the target function / adding a notion of scale on top of the 
usual smoothness properties. As we discuss in Example 12.11 a scale parameter will control 
the semi-norm by affecting the weight of the various derivatives involved. This may guide 
us in finding the kernel function with "optimal" scale parameter to set up a kernel-based 
approximation for a given set of data values — an important problem in practice for which 
no analytical solution exists. 

Since the Dirac delta function <5 at the origin is just a tempered distribution belonging 
to the dual space of the Schwartz space, the Green function G we introduce in Definition l4.3l 
needs to be regarded as a tempered distribution as well. Thus we want to define a distribu- 
tional operator L on the dual space of the Schwartz space so that LG = Sq. The distributional 
operator and its distributional adjoint operator are well-defined in Section HTTI According to 
Theorem 14.11 we can prove that an even Green function G e Re(C(R rf )) n SI is a condi- 
tionally positive definite function of some order m e No- Therefore, we can construct the 
related native space yVg(R d ) of G as a complete semi-inner product space. The native space 
can become a reproducing-kernel Hilbert space and its reproducing kernel is set up by the 
Green function and possible polynomial terms (see Section|3]and [21]). Moreover, the distri- 
butional operator L can be computed by a vector distributional operator P := (Pi, • ■ ■ , P„) T 
and its distributional adjoint P*, i.e., L = P* r P = Y!)=\ Under some sufficient condi- 

tions, we will further obtain a result in Theorem 14 . 2 I that shows that the native space N™(R d ) 
is always a subspace of the generalized Sobolev space Hp(R rf ) and that their semi-inner 
products are the same on Afg(R d ). This implies that the usual native spaces can be iso- 
metrically embedded into our generalized Sobolev spaces. By Lemma 1431 we know that 
H P (R d ) n C(R rf ) n L 2 (R d ) is also a subspace of N™(R d ). Theorems g3] and EH tell us that 
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N'^(R d ) may even be isometrically equivalent to Hp(R d ). However, we provide Example l5.5l 
to show that Afg(R d ) is not always equivalent to Hp(R rf ). In other words, Nq^K 1 ) is some- 
times just a proper subspace of Hp(Rr). We complete the proofs needed for the theoreti- 
cal framework in this article by applying generalized Fourier transform (see Definition 13.21 
and [21 1) and distributional Fourier transform (see Definition [472] and 1 18|) techniques. 



2 Background and Motivation 

Given data sites X = {jci, ■ ■ ■ ,Xn) c R d (which we also identify with the centers of our 
kernel functions below) and sampled values Y = fji, • • ■ c R of a real-valued contin- 
uous function / on X, we wish to approximate this function / by a linear combination of 
translates of a reproducing kernel K. 

To this end we set up the interpolant in the form 

N 

s f>x (x) := ^ CjK(x,xj), x e R d , (2.1) 

7=1 

and require it to satisfy the additional interpolation conditions 

SftX ( X j) = yj, j=l,...,N. (2.2) 

If K is a positive definite [21, Definition 6.24] reproducing kernel then the above system 
d2.2b is equivalent to a uniquely solvable linear system 

A KJ[ c = Y, 

where A K j := x*))^ e R NxN , c := (c u ■ ■ • , c N f and Y := (y u ■ ■ ■ ,y N ) T . Here a 

Hilbert space H K (R d ) of functions / : R d — » R is called a reproducing-kernel Hilbert space 
Il2~n Definition 10.1] with a reproducing kernel K : R d x R d -> R if 

(1) K(-,y) e H K {R d ) and (2) f(y) = (K(-,y),f) KKm , for all / g H K (R d ) and y e R d . 

It is well-known that the interpolant Sfj is the best approximation of an unknown func- 
tion / € H K (R d ) fitting the sample values Y on the data sites X. 

Example 2.1 We consider two reproducing kernels for differently scaled versions of the 
classical Sobolev space "H 2 (R): the kernel 



K s (x,y) := exp 



— \x-y\ 



Mn(i|x->'| + x,ye[ 



and the Sobolev spline (Matern) kernel 



K(x,y) := — (l + a-\x-y\)exp(-cr\x-y\), x,y e R, 

with scale parameter cr > 0. It is not difficult to show that these functions are Green functions 
of the differential operators U := / - -£-y + and L := (cr 2 ! - j^r) 2 , respectively. As a 
result the inner products for their real reproducing-kernel Hilbert spaces are 



(/^)h p ( 



f (f"(x)g"(x) + f'(x)g'{x) + f(x)g(x)) Ax, f,gE<rf{ 
Jr 
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and 

(J,g)n K m-= f (/''(x)i 77 w + 2 £ r 2 /'Wi 7 W + ^ 4 /Wiw)d^ /,geW 2 (i). 

We can also use the theoretical results of Section|43]to show that H^.(R) = , H 2 (R) = H X (R). 
This means that , H 2 (R) and Hjf(R) are isomorphic and indicates that these reproducing- 
kernel Hilbert spaces are isometrically equivalent to generalized Sobolev spaces. More de- 
tails are given in Example 15. 3 1 

This example shows that it may make sense to redefine the classical Sobolev space 
employing different inner products in terms of scale parameters even though -H 2 (R) and 
Ha:(R) are composed of functions with the same smoothness properties and are not distin- 
guished under standard Hilbert space theory (i.e., considered isomorphic). These different 
inner products provide us with a clearer understanding of the important role of the scale 
parameter. This formulation allows us to think of cr" 1 as the natural length scale dependent 
on the weight of various derivatives. The choice of smoothness and scale now tell us which 
kernel to use for a particular application. This choice may be performed by the user based 
on some a priori knowledge of the problem and based directly on the data. 

In the following section we briefly review how to use a conditionally positive definite 
function to construct reproducing kernels. 



3 Conditionally Positive Definite Functions and Native Spaces 

Most of the material presented in this section can be found in the excellent monograph 1211 . 
For the reader's convenience we repeat here what is essential to our discussion later on. 



3.1 Conditionally Positive Definite Functions 

Definition 3.1 CI 21.. Definition 8.1]) A continuous even function <t> : R d — > R is said to 
be a conditionally positive definite function of order m e No if, for all N e N, all pairwise 
distinct centers Xi, . . . ,x N e R d , and all c = (ci, • • • , c N ) T e R N \ |0| satisfying 

N 

for all p e ;r m _i(R rf ), the quadratic form 

N N 

J^J^CjCkQiXj-Xk) > 0. 

7=1 k=l 

In the case m = with n-\{M, d ) := { 0} the function <P is called positive definite. 

In general, we can not hope for a continuous <t> to be Lj (R rf )-integrable so that it has a 
Li(R rf )-Fourier transform. However, <t> always has a generalized Fourier transform. 

Next, we want to have a criterion to decide whether <t> is a conditionally positive definite 
function of order m e No. In Wendland's book [21], the generalized Fourier transform of 
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order m is employed to determine the conditional positive definiteness of <P. Let a special 
test function space S2 m 11211 Definition 8.8] be defined as 

S 2m :={yeS: y(x) = 0(||x||f) as ||*|| 2 -» o} , 

where the Schwartz space S EH Definition 5.17] consists of all functions y e C°°(R' < ) that 
satisfy 

sup \xPlFy{x)\ < C aAY 
for all multi-indices a,p e NjJ with a constant C av g >r . 

Definition 3.2 (121, Definition 8.9]) Suppose that <t> e C(R rf ) n SI. A measurable function 
€ Li, oc (R rf \{0}) is called a generalized Fourier transform of <P if there exists an integer 
m e No such that 

X<P(x)y(x)dx = I <f>(x)y(x)dx, for each y e S2 m . 
Jr'' 

The integer m is called the order of 0. 

Remark 3.1 If <P has a generalized Fourier transform of order m, then it has also order I > m. 
If <f € L 2 (R' / ) n COR 1 *), then its L 2 (R rf )-Fourier transform is a generalized Fourier transform 
of any order. 

Theorem 3.1 (121, Theorem 8.12]) Suppose an even function <P 6 Re(C(R rf )) n SI pos- 
sesses a generalized Fourier transform (f> of order m which is continuous on R. d \ |0). Then 
is conditionally positive definite of order m if and only ifcf> is nonnegative and nonvanishing. 



3.2 Native Space and Reproducing-Kemel Hilbert Space 

If <t> is conditionally positive definite of order m then 1211 Chapter 10.3] shows that <t> can 
be used to create a reproducing kernel and its reproducing-kernel Hilbert space. We firstly 
set up a native space N^,(K. d ) 1211 Definition 10.16]. It is a complete semi-inner product 
space and its null space of N%(R d ) is given by 7r m _i(R d ), i.e., |p|jv» , (R«') = if and only 
if p e 7r m -i(R d ) c N%(R d ). According to (21] Theorem 10.20], N%(R d ) will become a 
reproducing-kernel Hilbert space Hjf (R rf ) with the new inner product 

o 

(/, ghw) ■= if, 8)n^) + 2 /<&)*<&), /, g 6 (R rf ) = W"(R d ), 
t=i 

and its reproducing kernel is given by 

Q Q 
K(x,y) :=0(x - y) - £ ft (x)*<& -j) - £ «G0*(x - fi) 

i=l Z=l 

G e g 

+ 2] X <7i(*)*(j)<P(£t + 9k(.x)q k (y), 

k=\ l=\ k=l 

where [qi,- ■ ■ ,qo) is a Lagrange basis of n m -\(K d ) with respect to a ^■ m _i(R'')-unisolvent 
set {£j , • ■ • , £ e ) c R rf and Q = dim 7r,„_i (R rf ). Moreover the reproducing kernel K is positive 
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definite by [21 , Theorem 12.9]. We can also check that the interpolation j2.1t -l l2~2t by K 
is equivalent to the interpolation by a linear combination of translates of the conditionally 
positive definite function G of order m along with a basis of polynomials /r„,_i(K. rf ) (see [21. 
Chapter 8.5 and Chapter 12.3]), i.e., 

N Q 

s ft x(x) = J] c J ( x - */) + Zjfaqkix), x e R d . 

7=1 k=\ 

Theorem 3.2 (|21„ Theorem 10.21]) Suppose that <t> is a conditionally positive definite 
function of order m 6 No. Further suppose that <P has a generalized Fourier transform (f> of 
order m which is continuous on R \ {0}. Then its native space is characterized by 

N^(R d ) = {/ e Re(C(R d )) n SI : f has a generalized Fourier transform f 
of order m/2 such that 0" 1/2 / e L 2 (R d )} , 

and its semi-inner product satisfies 

(f,8)NS<#) = V*y m f ^ITT^^ f,geN™(R d ). 

4 Green Functions and Generalized Sobolev Space Connected to Conditionally 
Positive Definite Functions and Native Space 

4. 1 Distributional Operators and Distributional Adjoint Operators 

First, we can define a metric p on the Schwartz space S so that it becomes a Frechet space. 
Together with its metric p the Schwartz space S is regarded as the classical test function 
space. 

Let S' be the space of tempered distributions associated with S (the dual space of S, or 
space of continuous linear functionals on S). We introduce the notation 

(T, y) := T(y), for each T e S' and y e S. 

For each / e h[ oc (R d ) n SI there exists a unique tempered distribution Tf e S' such that 

(Tf,~y) = I f(x)y(x)dx, for each y e S. 

So / e h l " c (R d )nSI can be viewed as an element of S' and we rewrite Tf := f. This means 
that L' 1 oc (R rf ) n SI can be isometrically embedded into S' , i.e., \J° c (R d ) n SI Q S' . The 
Dirac delta function (Dirac distribution) 6q concentrated at the origin is also an element of 
S', i.e., (do, y) = y(0) for each y e S. Much more detail of the distributions are discussed in 
Chapter 7.1] and [18 Chapter 1.3]. 

Given a linear operator P : S' — > S', is it always possible to define a linear (adjoint) 
operator P* : S' — > S' which also satisfies the usual adjoint properties? The answer to 
this question is that it may not be possible for all P. However, adjoint operators are well- 
defined for certain special linear operators. We will refer to these special linear operators as 
distributional operators and to their adjoint operators as distributional adjoint operators in 
this article. 



8 



Gregory E. Fasshauer, Qi Ye 



We first introduce these linear operators on S'. Let f* : S — > S be a continuous linear 
operator. Then a linear operator P : S' —> S' induced by V* can be denoted via the form 

(PT, y> := (T, V*y), for each T e S' and y e 5, i.e., P(T) =Top*. 

Furthermore, if P\$ is a continuous operator from S into 5, i.e., {Py : y e *S| c S and 
p(Py„,Py) — » when p(y„,y) — » 0, then we call the linear operator P a distributional 
operator. 

Next we will show that the adjoint operators of these distributional operators are well- 
defined in the following way. In the same manner as before, we can denote another linear 
operator P* : S' — » S' induced by P\s, i.e., 



for each y e S which implies that P*y = Vy. Hence P*\$ = V* on S and P*\$ is a contin- 
uous operator from S into S. Therefore P* is also a distributional operator. This motivates 
us to call P* the distributional adjoint operator of P. According to the above definition, P is 
also the distributional adjoint operator of the distributional operator P*. 

Remark 4.1 In the standard literature [9. Chapter 8.3] P*\s corresponds to the classical ad- 
joint operator of P. Here we can think of the classical adjoint operator P*\$ being extended 
to the distributional adjoint operator P* . Our distributional adjoint operator differs from the 
adjoint operator of a bounded linear operator defined in Hilbert space or Banach space. Our 
operator is defined in the dual space of the Schwartz space and it may not be a bounded 
operator if S' is defined as a metric space. But it is continuous when S' is given the weak- 
star topology as the dual of S. However, since the fundamental idea of our construction is 
similar to the classical ones we also call this an adjoint. 

We now summarize the definitions of the distributional operator and its adjoint operator. 

Definition 4.1 Let P, P* : S' — > S' be two linear operators. If P\s and are continuous 
operators from S into S such that 

(PT, y> = (T, P*y) and (FT, y> = (T, Py), for each T e S' and y 6 S, 

then P and P* are said to be distributional operators and, moreover, P* (or P) is called a 
distributional adjoint operator of P (or P*). 

We will simplify the term distributional adjoint operator to adjoint operator in this arti- 
cle. 

If P = P*, then we call P self-adjoint. A distributional operator P is called translation 
invariant if 

T h Py = Pr h y, for each h e R d and y e S, 
where 77, is defined by r/,y(jc) := y(x-h). A distributional operator is called complex-adjoint 



(P*T, y) := (T, P\ s y) = (T, Py), for each T e S' and y e S. 



Fixing any y e S, we have 




invariant if 



Py = Py, for each y e S. 
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Now we introduce two typical examples of distributional operators. One is the differen- 
tial operator (with constant coefficients) which is a linear combination of the distributional 
derivatives P := D a : S' — » S' . The distributional derivative is extended by the (strong) 
derivative 

for the formula 

(D"T,y) := (-\) M (T,D"y), for each T eS' andye S, 

(see 1 1 , Chapter 1.5]). It is easy to check that the distributional derivative is a distributional 
operator. So we can determine that the differential operator is a distributional operator, i.e., 

P := c cjy, P* ■= y (-l) M c a D a , where c a e C and a e Nq, n e N . 

The other kind of distributional operator is defined for any fixed function 

peTT :={fe C°(R d ) : D°7 e *SJ for each a e N^} . 

It is obvious that all complex-valued polynomials belong to !F7~. Since py e S for each 
y € S, we can verify that the linear operator yn py is a continuous operator from S into 
S. Thus this distributional operator P related to p is denoted as 

(PT, y) := (T, py), for each T e S' and ye5. 

We can further check that this operator is self-adjoint and Pg = pg e L l ° c (R d ) n >SJ if 
g e L' oc (R d ) n >SJ. Therefore we use the notation P := p for convenience. The TT space is 
also applied in the definition of distributional Fourier transforms of distributional operators 
in Section POl 



4.2 Distributional Fourier Transforms 

We denote y e S and y 6 S to be the Li(R d )-Fourier transform and inverse Li(R d )-Fourier 
transform (unitary and using angular frequency) of the test function y e S as in 1211 Defini- 
tion 5.15]. 

Following the theoretical results of [9 Chapter 7. 1] and [18, Chapter 1 .3] we can define 
the distributional Fourier transform T € S' of the tempered distribution T e S' by 

(T,y):=(T,y), for each y e <S. 

The fact (T,y) = (7\ y) implies that the Li(R. rf )-Fourier transform of y 6 S is the same 
as its distributional transform. If / e L,2(R d ), then its L^K.'O-Fourier transform is equal 
to its distributional Fourier transform. The distributional Fourier transform <5o of the Dirac 
delta function Sq is equal to {27t)~ d l 2 . Moreover, we can check that the distributional Fourier 
transform map is an isomorphism of the topological vector space S' onto itself. This shows 
that the distributional Fourier transform map is also a distributional operator. 

If <t> 6 C(R d ) n SI has the generalized Fourier transform of order m, then its gen- 
eralized Fourier transform and its distributional Fourier transform coincide on the set Stm, 
i.e., 

(0,y) = (<P, y) = I <P(x)y(x)dx = I cf>(x)y(x)dx , for each y e S 2m . 
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Even if <P does not have any generalized Fourier transform, it always has a distributional 
Fourier transform <t> since can be seen as a tempered distribution. 

Our main goal in this subsection is to define the distributional Fourier transform of a 
distributional operator induced by the TT space introduced in Section l4~Tl 

Definition 4.2 Let P be a distributional operator. If there is a function p e TT such that 

(PJ, y) = (pf, y) = (f, py), for each T e S' and y e S, 
then p is said to be a distributional Fourier transform of P. 

If P has the distributional Fourier transform p, then P is translation-invariant because Ti,Py(x) = 
e -\x hp(x)y(x) = Pr h y(x) for each h e R d and y e S. Moreover, if P is complex-adjoint 
invariant and has the distributional Fourier transform p, then 

(pf,y) = (t,Jf) = (t,Pf) = (T,Pf) = (T,Pf) = (P*T,f) = (Ff,y) 

for each T e S' and y e S. This shows that p is the distributional Fourier transform of the 
adjoint operator P* of P. 

Because of D a y = (py) for each y e S, we can show that any distributional derivative 

D a has the distributional Fourier transform p(x) := (zjc)* where i = y—1. This also im- 
plies that the distributional Fourier transform p* of its adjoint operator (-l)'"'D ff is equal to 
p*(x) = (-ix) a = p(x). Furthermore, we can also obtain the distributional Fourier transform 
of a differential operator in the same way, e.g., 

p(x) = ^ c a (ix) a , where P = ^ c a D a , c a e C and a e N„, n e N () . 



4.3 Green Functions and Generalized Sobolev Space 

Definition 4.3 G is the (full-space) Green function with respect to the distributional opera- 
tor LifGeS' satisfies the equation 

LG = 6 . (4.1) 

Equation 04.1b is to be interpreted in the sense of distributions which means that (G, L*y) = 
(LG, y) = (do, y) = y(0) for each y 6 S. 

According to Theorem l3.1l and [12] we can obtain the following theorem. 

Theorem 4.1 Let L be a distributional operator with distributional Fourier transform I. 
Suppose that I is positive on R rf \|0). Further suppose that l~ x E SI and that l{x) = 0(||x||^ m ) 
as \\x\\ 2 — » Ofor some m 6 No. If the Green function G € Re(C(R rf )) n SI with respect to L 
is an even function, then G is a conditionally positive definite function of order m and 

Q m (x) := {2ny dl2 i(x)-\ xeW*, 

is the generalized Fourier transform of order m of G. (Here the notation f = 0(g) means 
that there are two positive numbers M\ and M 2 such that M\ \g\ < |/| < M 2 \g\.) 
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Proof First we want to prove that g„, is the generalized Fourier transform of order m of G. 
Since Z _1 e SI and l(x) = 0(\\x\\%") as ||jc|| 2 — > for some m e N u , the product g m y is 
integrable for each y e »S 2m . Let G be the distributional Fourier transform of G. If we can 
verify that 



(G,y) = g,„(x)y(x)dx, for each y e 5 2m , 
J*' 

then we are able to conclude that g,„ is the generalized Fourier transform of G. 

Since / is the distributional Fourier transform of the distributional operator L we know 
that I e TT. Thus D a (r 1 ) e SI for each a e because of D"f e 5 J and r 1 e *SJ. If 
/(0) > 0, then Z" 1 e !F7~, which implies that l~ x y e 5 for each fixed y e >S 2m . Hence 

<G,y> = (JG,r l y) = (LG,r l y) = (SoJ^y) = {(2n)- dl \ T'y) 
(2;r)-'' /2 /(j:)" 1 y(*)dj: = f Q m (x)y{x)dx. 



J 



If Z(0) = 0, then / 1 does not belong to TT. However, since / e TT is positive on 
l d \ {0} we can find a positive- valued sequence {/„)" =1 c C°°(IR'') such that 



l„(x) 



l(x), \\x\\ 2 >n-\ 
t(x) + n-\ ||*|| 2 <n- 2 . 



In particular l\ = 1. And then { l n )„-\ c TT. It further follows that D a l„ converges uniformly 
to D a l on R d for each a e N[J. 

We now fix an arbitrary y e vS 2m . Since /,7'y and l~ x y have absolutely finite inte- 
gral, l~ l y converges to l~ x y in the integral sense. Let y„ := l~ l y. We can also check that 
(ty„Y converges to y point wisely which indicates that G(x) (ty„^\x)dx converges to 
Lj G(x)y(x)dx. Thus we have 

(y,G) s = \im(ty n ,G)s = lim(y„,LG) s = lim<y n ,5 ) 5 = lim(y„, (2n)- d l 2 ) s 

n—>co n — »oo n— *co /j — »oo 

= lim f (2n)- dll l n (xY x y(x)Ax = \ (2n)- d l 2 !(x)- l y(x)&x = \ G m (x)y(x)dx. 

Since g m e C(R d \{0}) is positive on R d \{0} and G e Re(C(R d ))nSI is an even function, 
we can use Theorem 13.11 to conclude that G is a conditionally positive definite function of 
order in. □ 

Remark 4.2 If L is a differential operator, then its distributional Fourier transform / satisfies 
the conditions of Theorem 14. II if and only if / has a polynomial of the form l(x) := q(x) + 
aim IWI 2 ™, where a 2 „, > and q is a polynomial of degree greater than 2m so that it is 
positive on R d \ |0), or q = 0. 

Now we can define the generalized Sobolev space induced by a vector distributional 
operator P = (Pi, • • ■ , P„, ■ ■ ■ ) T similar as in 11221 Definition 6]. 

Definition 4.4 Consider the vector distributional operator P = (Pi, ■ • • , P„, ■ ■ • ) T consist- 
ing of countably many distributional operators {Pj}J =l . The real generalized Sobolev space 
induced by P is defined by 



n P (R d ) := {/ g Re(Lf c (R d )) n SI : {Pjf}J =1 C L 2 (R d ) and £ < °° 
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and it is equipped with the semi-inner product 



(/> g)H P (R' i ) : 




Pjf{x)P jg (x)dx, /,geH P (R d ) 



For example, if we let P.- := D a for each a e Nq and \a\ < n and the others be zero 
operators, then the classical L,2-based Sobolev space "H"(R. rf ) = Hp(R rf ) is a special case 
of the generalized Sobolev space. If we choose the vector distributional operator P as in 
Example 15.31 then Hp(R rf ) and *H" '(Br) are isomorphic to each other which indicates that 
we redefine the Sobolev space for different inner products using the scale parameter cr > 0. 
Generalized Sobolev spaces can also become different kinds of Beppo-Levi spaces with cor- 
responding semi-inner products (see Example 15.6) . The reproducing-kernel Hilbert space of 
the Gaussian kernel will be isometrically equivalent to a generalized Sobolev space Hp(R. d ) 
as well as explained in Example 1 5. 8 1 

Now we discuss the relationship between the generalized Sobolev space and the native 
space. In the following theorems of this section we only consider P constructed by a finite 
number of distributional operators P\,...,P n which means that Pi := when j > n. If 
P := (Pi, • • • , P„) T , then the distributional operator 



is well-defined, where P* := (P*, • • ■ , P*) r is the adjoint operator of P as defined in Sec- 
tion |4T| If we suppose that P is complex-adjoint invariant with distributional Fourier trans- 
form p = (pi, ■ ■ ■ ,p„) T , then the distributional Fourier transform p* = (p* • • • ,p*) T of its 
adjoint operator P* is equal to p = (p 1 , ■ ■ ■ ,p„) T . Since 



Moreover, since P has a distributional Fourier transform, P is translation invariant (see Sec- 
tk>n@3J. 

We are now ready to state and prove our main theorem about the generalized Sobolev 
space Hp(R c ') induced by a vector distributional operator P := (Pi, ■ • • , P n ) T . 

Theorem 4.2 Let P := (Pi, ■ ■ ■ , P„) r be a complex-adjoint invariant vector distributional 
operator with vector distributional Fourier transform p := (pi, ■ ■ ■ ,p„) T which is nonzero 
on R d \ {0}. Further suppose that x h-> HpX*)!^ 1 € SI and that ||p(jc)|| 2 = ^(IMI™ ) as 
1Mb ~ * far some m 6 No. If the Green function G € Re(C(R. rf )) n SI with respect to 
L = p* r p i s chosen so that it is an even function, then G is a conditionally positive definite 
function of order m and its native space A/J?(R d ) is a subspace of the generalized Sobolev 
space Hp(R d ). Moreover, their semi-inner products are the same on N'£(R. d ), i.e., 




{P)PJT, 7 ) = (p)PjT,y) = (pjt.p'y) = (PjPjty) = (\ Pj \ 2 ty) 



for each T e S' and y e S, the distributional Fourier transform / of L is given by 



l(x) := ^ \pj(xf = \\P(x)\\l , xeR d . 



Title Suppressed Due to Excessive Length 



13 



Proof By our earlier discussion the distributional Fourier transform / of L is equal to l(x) = 
llpOOIIj. Thus I is positive on R d \ {0|, t 1 e SI and l(x) = 0(I|x||J") as [|x|| 2 -> 0. According 
to Theorem l4.ll G is a conditionally positive definite function of order m and its generalized 
Fourier transform of order m is given by 

g, n (x) := {27TT d/2 i(x)- ] = {2nT m Ilp(*)ll 2 2 , xeR d . 

With the material developed thus far we are able construct its native space N£(R d ) (see 
Sectionll2li. 

Next, we fix any / 6 N'£(R d ). According to Theorem|3]2] / e Rs(C(R d ))C\SI possesses 
a generalized Fourier transform / of order m/2 and x i-» /(jc) ||p(jc)|| 2 € h2(R d ). This means 
that the functions pjf belong to L 2 (R d ), j = 1, . . . ,n. Hence we can define the function 

f P . € L 2 (R rf ) by 

:= (PjfJt L 2 (R << ), ;= 1 « 

using the inverse L 2 (R d )-Fourier transform. 

Since ||p(*)|| 2 = 6>(IMI 2 ) as ||x|| 2 -> we have pj(x) = 0(||x|| 2 ) as ||x|| 2 -> for each 

y = 1, ... Thus /3jy 6 S ln for each y 6 S. Moreover, since pjy = pff = p*y = P*y and 
the generalized and distributional Fourier transforms of / coincide on S„, we have 

f f Pi (x)j7F)dx = f (PjfMy&)te= f (Pjf)(x)y(x)dx 

Jr' Jr<' Jr>' 

=</, ftr> = </, ?*y> = </, ?jy> = </, ^r> = (Pjf, f>, r e 5. 

This shows that P,/ = f P . e L 2 (R d ). Therefore we know that / 6 H P (R tf ). 
To establish equality of the semi-inner products we let /, g e (R rf ). Then the Plancherel 
theorem 1181 yields 



(/,£)hp(r") = ^ J^ ( /^-(x)g/»;(x)dJc = 2_j J^(Pjf)(x){Pjg)(x)dx 

= f f(x)W)\\P(x)\\ 2 2 dx= f /(x)|W/(x)dx 

Jr-' Jr-' 

, x-d/2 r /(x)^(x) . , 

= (27r) n d - t = (/'g)^)- 

Jr" Qm(x) 



Remark 4.3 If each element of P is just a differential operator then all their coefficients are 
real numbers because P is complex-adjoint invariant. 

The preceding theorem shows that N'£(R d ) can be isometrically embedded into Hp(R d ). 
Ideally, N'£(R d ) would be equal to Hp(R rf ), but this is not true in general. However, if we 
impose some additional conditions on H P (R ), then we can obtain equality. 

Definition 4.5 Let P := (P\ , • • • , P n ) T be a vector distributional operator. We say that the 
generalized Sobolev space Hp(R rf ) possesses the S-dense property if for every / 6 Hp(R d ), 
every compact subset A c R d and every e > 0, there exists y e S Ci Hp(R rf ) such that 

1/ " ylH P (R<<) < e and 11/ - rlL(A) < e - (4-2) 
i.e., there is a sequence {y„}™ =1 Q S fl Hp(R d ) so that 

1/ - 7«Ih p (r") -> and \\f - y„|| LooM) -> 0, when n -> oo. 
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Following the method of the proofs of 11211 Theorems 10.41 and 10.43], we can complete 
the proofs of the following lemma and theorem. 



Lemma 4.3 Let P and G satisfy the conditions of Theorem \4.2\ and suppose that Hp(R rf ) 
has the S-dense property. Assume we are given the data sites [x\, ■ ■ ■ , X^} C R. and scalars 
■ ■ ■ , An] C R. If we define fx '.= Ztli ^-kG(- — x k ), then for every f e 1 1|>( ') and every 
X 6 R we have the representation 

N 

(Mx - •), AW) = 2 A*f(* ~ **)• (4-3) 

Proof Let us first assume that yeSn H P (R rf ). According to Theorem 1431 6 N'£(R d ) C 
H P (R ), Since P is translation invariant and complex-adjoint invariant we have 

n p n p 

(f A (x - ■),y) Hp ( R ") = E Pj, y fAx-y)P~W)dy = V -y)P/y(y)&y 



n p N r 

= Y(Mx--),P*Pj 7 )= fA(y)L y y(x-y)dy = Y A k G(y - x k )L y y(x - y)dy 

www 
= 2] /U(LG, y(x - xjt - •)> = ^ 4fc<<?0, r(x - x t - •)> = J] ~ **)• 



«r=l Jt= 1 A-1 

For a general / e H P (R d ) we fix j: € R rf and choose a compact set A c l rf such that 
x - x k € /l for A: = 1, . . . , N. For any e > 0, there is a 7 6 5 (1 Hp(R < ') which satisfies Equa- 
tion I l4.2t . Then two applications of the triangle inequality show that the absolute value of 
the difference in the two sides of Equation d4.3t can be bounded by e(Zf =1 \A k \ + I/iIh p (R''))' 
which tends to zero as e — > 0. □ 



Theorem 4.4 Let P and G satisfy the conditions of Theorem \4.2\ 7f Hp(R ) possesses the 
S-dense property, then 

N%(R d ) = '). 

Proof By Theorem l4.2l we already know that N™ (R rf ) is contained in Hp(R rf ) and that their 
semi-inner products are the same in the subspace (R rf ). Moreover, N 1 ^ (R rf ) is a complete 
subspace of Hp(R. rf ). So, if we assume that JV™(R rf ) were not the whole space Hp(R rf ), then 
there would be an element / 6 Hp(R </ ) which is orthogonal to the native space Afg(R ). 

Let Q = dim/r„,_i(R' / ) and {q\, ■ ■ ■ , qo] be a Lagrange basis of n m -i(R d ) with respect 
to a 7r m _i(R d )-unisolvent subset {£ l5 • ■ • ,£q\ c R d . We make the special choice of the data 
sites {— x, ■ ■ ■ and scalars {1,-i7i(jc), ■ ■ ■ , — qQ(x)} and correspondingly define 



fi :=G(-+x)-Y j q k (x)G(-+t k ). 



Since Hp(R rf ) has the .S-dense property we can use Lemma l4~3l to represent any / e Hp(R rf ) 
in the form 

Q 

f(w + x) = 2] qk(x)f{w + &) + (ftOe - ■), /)h p (e")- 



Title Suppressed Due to Excessive Length 



15 



Since G is even, we have x t-> f\(-x) e N'£(R d ). We now set w = 0. The fact that / is 
orthogonal to Nq (R d ) gives us 

Q Q 

fix) = Yj + (fii-l / WO = Z f^k(x). 

k=l k=\ 

This shows that / e ^ m _i(R rf ) c Ng{R d ), and it contradicts our first assumption. It follows 
that N'"(R d ) = H P (R d ). □ 



Lemma 4.5 Let P ant/ G satisfy the conditions of Theorem \4.2\ Then 

H P (R d ) n C(R d ) n L 2 (R d ) c N%(R d X 

Proof We fix any / 6 H P (R d ) n C(R d ) n L 2 (R rf ) and suppose that / and P~f, respectively, 
are the L^R^-Fourier transforms of / and Pjf, j = 1, . . . ,«. Using the Plancherel theorem 
fT8l we obtain 

f (pjf)(x)(pjfj(x)dx = f P~f(x)P~f(x)dx = f Pjf(x)P]f(xjdx<™. 

JW JR'I Jr** 



And therefore, with the help of the proof of Theorem 14. 2 1 we have 

|2 



f L^Lcb: = (2^ f |/(x)| 2 l(x)dx = (Inf 2 f \f(xtf \\p(x)\\l dx 

JR'I 9mW JR'I JRd 

= {2nf 2 y f \f{x)pj(x)\ 2 dx<<x, 



showing that g„, / e L 2 (R ), where g„, is the generalized Fourier transform of G. And 



now, according to Theorem [37T] / e N™(R d ). □ 



This says that H P (R rf ) n C(R d ) n L 2 (R d ) can be isometrically embedded into N'£(R d ). 
Moreover, according to Lemma l4~5l we can immediately obtain the following theorem. 

Theorem 4.6 Let P and G satisfy the conditions of Theorem\4Jl IfH v (R d ) c L 2 (R d ), then 
G is positive definite and 



Nc(R d ) = H P (R d ). 



(It also indicates that m = 0.) 



Proof Since G e N™(R d ) Q tt v (R d ) c L 2 (R rf ), its generalized Fourier transform of any 
order is equal to its L^R^-Fourier transform which implies that g„, e L 2 (R rf ) n L^R^). So 



x ||p(x)||2 6 L 2 (R d ) and ||p(*)|| 2 = 0(1) as ||x|| 2 -> 0. According to Theorem 1421 G is a 
positive definite function. 

We fix any / e Hp(R rf ) C L 2 (R d ). According to the proof of Lemma 1431 we have its 
distributional Fourier transform / e L 2 (R d ) and 



yf \pjf(x)\ 2 dx = y f \pj(x)f(xfdx= f \\p(x)\\i\f(x)\ 2 dx. 

j-f JR' 1 1 1 j~f JR d JR'' 

This means in particular that / e Li (R d ) because 

f |/(*)|d*<(f \\P{x)\\ 2 2 \f(x)\ 2 )' (f \\p(x)\\- 2 
Jr* \Jri ) \Jw 
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Thus, the inverse Li (R rf )-Fourier transform of / is equal to the inverse L2(R rf )-Fourier trans- 
form of / which can be identified with /. This implies that / e C(R rf ). According to Theo- 
rem[0]and Lemma|43] we have N™(R d ) = H P (R d ). □ 

Remark 4.4 As Example 15. 5 l in Section l5T2l shows. the native space N^(R d ) will not always 
be equal to the corresponding generalized Sobolev space Hp(R rf ). 



5 Examples of Green Functions and their Related Generalized Sobolev Spaces 

5.1 One-Dimensional Cases 

Example 5.1 (Cubic Splines) Consider the (scalar) distributional operator P := d 2 /dx 2 and 
L := P* r P = d 4 /d;c 4 . By integrating Equation 04. Il l four times we can obtain a family of 
possible Green functions with respect to L, i.e., 

G(x) := -j-^- + a^x 3 + aix 2 + a\x + ao, iel, 

where aj € R, j = 0, 1,2, 3. However, we want the Green function to be an even function. 
Hence, we choose 

G(x) := — \x\ 3 , xeR. 
12 

This ensures that P and G satisfy the conditions of Theorem 14.21 and ||p(x)|| 2 = \x\ 2 . As a 
result, the associated interpolant is given by 

N 

Sfxix) :=Yj-\x-Xj\ + /3 2 x +p u xeR. 
H 

This is the same as the cubic spline interpolant (see |2, Chapter 6.1.5]). 

According to 11211 Theorem 10.40], we can check that Hp(R) has the .S-dense property. 
Therefore, Theorem I4.4l tells us that JV^(R) = Hp(R) and it follows that the cubic spline is 
the optimal interpolant for all functions in the generalized Sobolev space Hp(R). 

Example 5.2 (Tension Splines) Let cr > be a tension parameter and consider the vector 
distributional operator P := (d 2 /djc 2 ,crd/dx) r and L := P* r P = d 4 /dx 4 - cr 2 d 2 /dx 2 . Then 

G(x) := (exp(-cr \x\) + cr \x\) , xeR, 

is a solution of Equation fl4. lb . We can verify that P and G satisfy the conditions of Theo- 
rem g3] and that ||p(jc)|| 2 = (|jc| 4 + cr 2 \x\ 2 ) 1/2 = 0(\x\) as \x\ -» 0. So G is a conditionally 
positive definite function of order 1. This yields the same interpolant as the tension spline 
interpolant (4 1161 . 

According to the Sobolev inequality |1| and [21 Theorem 10.40], Hp(R) has the S- 
dense property which implies that Nq(R) = Hp(R). Theorem 14.41 and 1211 Theorem 13.2] 
provide us with the same optimality property as stated in B1 I161 . 
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Example 5.3 (Examvle \2.1\ 

We use the theoretical results of Section l4~3l to verify the reproducing-kernel properties 
of Example 12. II Here we only give details for the Sobolev spline kernel as the other kernel 
can be treated in the same way. Let P := (d 2 /dx 2 , V2crd/d;t, cr 2 I) T and L := P* r P = (cr 2 I - 
d 2 /djc 2 ) 2 . It is known that the Green function with respect to L is the Matern function 

G(x) := — t(1 +cr\x\)exp(-cr\x\), xeR. 

Since P and G satisfy the conditions of Theorem 14.21 and ||p(jc)|| 2 = a 2 + x 2 = 0(1) as 
\x\ — > 0, we can determine that G is positive definite. It is easy to check that Hp(R) = 7-{ 2 (W). 
Since we have "H 2 (R) C L 2 (R), we can use Theorem @3] to check that N%(K) = H P (E). As 
discussed in Section [3T2l the reproducing kernel and its reproducing-kernel Hilbert space 
have the forms K(x,y) = G(x - y) and tt K (W) = H P (R). 



5.2 Two-Dimensional Cases 

Example 5.4 (Thin Plate Splines) Let P := (d 2 /dx 2 , ^ld 2 ldx x dx 1 , d 2 /dx 2 ) T so that L := 
P* r P = A 2 . It is well-known that the fundamental solution of the Poisson equation on R 2 is 
given by x i — > log ||jc|| 2 , i.e., A log \\x\\ 2 = -2nd. Therefore Equation d4.lt is solved by 

G(x):= -!-|W| 2 log|W| 2 , *eR 2 . (5.1) 

Since P and G satisfy the conditions of Theorem l4.2l and ||p(jc)|| 2 = ||x|| 2 , G is a conditionally 
positive definite function of order 2 and its related interpolant has the form 

N 

s fiX (x):=Y J CjG(x-Xj)+/3 3 X2+p 2 x\+Pu x = (x u x 2 ) e R 2 . (5.2) 

7=1 

Moreover, according to 0211 Theorem 10.40], we can verify that Hp(R 2 ) has the .S-dense 
property. Therefore, yVg(R 2 ) = Hp(R 2 ) by Theorem l4.4l Equation d5.2t is known as the thin 
plate spline interpolant (see [3 6 10)). 

Finally, we consider the Duchon semi-norm mentioned in [6|, i.e., 

2 

d*, / e Lf f (R 2 ) n SI, 

and the Duchon semi-norm space 

•H fl2 (R 2 ) := {/ 6 Re(Lf (R 2 )) n SI : \f\ Di < oo) . 

If we define P as above, then it is easy to check that Hp(R 2 ) = , H D2 (R 2 ). According to 1211 
Theorems 13.1 and 13.2] we can conclude that the Duchon semi-norm space possesses the 
same optimality properties as those listed in (6). 

The following example shows that the same Green function G can generate different 
generalized Sobolev spaces Hp(R rf ). Moreover, it illustrates the fact that the native space 
yV™(R rf ) may be a proper subspace of Hp(R d ). 



\f\l 2 ■= 



f 


d 2 f{x) 


Z 

+ 2 


d 2 f{x) 


z 


d 2 f(x) 


J'A 2 


dx 2 




dx\dx2 


+ 


dx 2 
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Example 5.5 (Modified Thin Plate Splines) Let P := A and L := P* r P = A 2 . We find that 
the thin plate spline &5A[ is also the Green function with respect to the operator L defined 
here. The associated interpolant is again of the form (15.2) . 
We now consider the Laplacian semi-norm 

|/g:= f kf/(x)| 2 dr, / £ L' oc (R 2 ) n SI , 
Jr 2 

and the Laplacian semi-norm space 

7^(R 2 ) := {/ 6 Re(Lf (R 2 )) n SI : \f\ A < ™] . 

It is easy to verify that Hp(R 2 ) = < Ha(M 2 )- However, it is known that 'H £ > 2 (R 2 ) is a proper 
subspace of ^(R 2 ) since q e ^(R 2 ) but q i where q(x) := x\X2- Therefore, due to 
Example l5.4l we conclude that 

Af 2 (R 2 ) = •Ho.CR 2 ) S ^(R 2 ) = H P (R 2 ). 

Instead of working with the polynomial space /^(R 2 ) which is used to define A/2(R 2 ), we 
can construct a new native space A/^f (R 2 ) for G by using another finite-dimensional space 
@> of Re(C 2 (R 2 )) n SI such that Nf(R 2 ) may be equal to the other subspace of H P (R 2 ). 
First we can verify that the finite-dimensional space £P := span {tti(R 2 ) U is a subspace 
of the null space of H P (R 2 ). Since tti(R 2 ) c 2? and G is a conditionally positive definite 
function of order 2, we know that G is also conditionally positive definite with respect to 
& 1 . Hence, the new native space Nq (R 2 ) with respect to G and 3* is well-defined (see 1211 
Chapter 10.3]). We can further check that Af^f (R 2 ) is a subspace of Hp(R 2 ) but it is larger 
than A/£(R 2 ), i.e., JV 2 (R 2 ) £ Nf(R 2 ) Q H P (R 2 ). 

So we can obtain a modification of the thin plate spline interpolant based on 

N 

sf x (x) := 2^ CjG(x - Xj) + P$xiX2 + P3X2 + P2X1 x = (x\,X2) e R 2 . 

./=! 

Conjecture 5.1 Motivated by Example 15.51 we audaciously guess the following extension 
of the theorems in Section \4.3\ Let P and G satisfy the conditions of Theorem \4.2\ If the 
subspace 9 s of the null space o/Hp(R d ) is a finite-dimensional subspace and n m -\ (R d ) Q 2P, 
then the new native space AQ?(R 2 ) with respect to G and 3? is a subspace o/Hp(R d ). 



5.3 d-Dimensional Cases 

Example 5.6 (Polyharmonic Splines) This is a generalization of the earlier Examples 15.11 
andET4l Let P := (d^/dx?, • • ■ , (m!/a!) 1/2 IT, ■ ■ ■ , o m /dx%) T consisting of all (m!/a!) 1/2 D a 
with \a\ = m > d/2. We further denote L := P* r P = (-l) m A m . Then the polyharmonic spline 
on R d is the solution of Equation d4.1b (see [i2, Chapter 6.1.5]), i.e., 



G(x) 



r(d/2-m) 



•s. ,..;,„, h . ii-' 11. for d odd, 

log ||jc|| 2 for d even. 



(-D'"'' /2 ~' \\ r \\ 2 '"-d 
2 2 '»-'^«-(m-l)!(m-rf/2)! I|A|I 2 



We can also check that P and G satisfy the conditions of Theorem 14.21 and that ||$(x)|| 2 = 
\\x\\™ ■ Therefore G is a conditionally positive definite function of order m. Furthermore, 
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according to 1211 Theorem 10.40], we can verify that Hp(R d ) has the >S-dense property. 



Therefore, N'"(R d ) = H P (R d ) by Theorem |44 



We now consider the Beppo-Levi space of order m on R , i.e., 

BL m (R d ) := {/ g Re(Lf c (R d )) : D a f e L 2 (R d ) for all \a\ = m] 
equipped with the semi-inner product 



(/■^, S L„m-= X 4 f O a /(x)D*g(*)dx, f,geBL m (R d ). 



\a\=m 

According to HQ, we know that BL m (R d ) C Re(L'° c (R d )) n ST whenever m > d/2. Hence 
H P (R rf ) = BL m (R d ). 

By the way, it is well-known that G is also conditionally positive definite of order I := 
m - \d/2~\ + 1 (see 1211 Corollary 8.8]). However, the native space N G (R d ) induced by G and 
7Tj_i(R ) is a proper subspace of N' G (R d ) when d > 1. Therefore 

N l G (R d ) S Are(R rf ) s H P (R d ) = BL m (R d ), d > 1. 

Remark 5.1 If we have a vector distributional operator P := (Pi , ■ ■ ■ , P„) r whose distribu- 
tional Fourier transform satisfies x i-» ||p(jc)||2 € 7T2 m (R f ') and 

ja^D" : [a| = m, a e Nq} c {p ; - : y = 1, . . . ,nj , where a„ # and m > d/2, 

then Hp(R rf ) c BL m (R d ). According to the Sobolev inequality 1 1], there is a positive constant 
C such that 11/11^^ < C ||/)|^ „ i} for each / e H P (R rf ). This implies that this generalized 
Sobolev space Hp(R d ) also has the >S-dense property. 

Example 5.7 (Sobolev Splines, ^22] Example 3]) This is a generalization of Example 15.31 
LetP:=(Qj,--- ,Q^) r , where 

when j = 2k, 
zf V when j = 2k + 1, 




Q; := 1/2 t *eN , 7 = 0,1,. ...n, n>d/2. 



Here we use A := A We further define L := P* r P = (o- 2 / - zl) n . 

The Sobolev spline (or Matern function) is known to be the Green function with respect 
to L (see (2[ Chapter 6.1. 6] and (7] Chapter 13.2]), i.e., 

9 1 -n-d/2 

G(X) ■= ^E(n)o-^- d (^IWbr^-^IWb), xeR', 

where z i-» A" r (z) is the modified Bessel function of the second kind of order v. Since P and G 
satisfy the conditions of Theorem l4.2l and ||p(jc)|| 2 = 0(1) as \\x\\ 2 — » 0, G is positive definite 
and the associated interpolant s*ji is the same as the Sobolev spline (or Matern) interpolant. 
Combining | 22, Example 3] and Theorem l4.6l we can determine that 

N° G (R d ) = H P (R d ) = ?{ n (R d ). 

Moreover, this shows that the classical Sobolev space 7-("(R d ) becomes a reproducing-kernel 
Hilbert space with Hp(R rf )-inner product and its reproducing kernel is given by K(x,y) := 
G(x-y). 
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In the following example we are not able to establish that the operator P satisfies the 
conditions of Theorem 14.21 and so part of the connection to the theory developed in this 
paper is lost. We therefore use the symbol <P to denote the kernel instead of G. 

Example 5.8 (Gaussians, [22 Example 4]) The Gaussian kernel K(x,y) := <P(x-y) derived 
by the Gaussian function <P is very important and popular in the current research fields of 
scattered data approximation and machine learning. Therefore knowledge of the native space 
of the Gaussian function or the reproducing-kernel Hilbert space of the Gaussian kernel is 
of significant interest. In this example we will show that the native space of the Gaussian 
function is isometrically equivalent to a generalized Sobolev space. 
We firstly consider the Gaussian function 

<P(x) := exp(-o- 2 x e R , cr > 0. 

We know that <P is a positive definite function and its native space N^(V d ) is a reproducing 
kernel Hilbert space (see J7] Chapter 4]). 
Let P := (Qj, ■ • • , Qj, • • • f , where 

{(-nth*) 1 ' 2 A k whenrc = 2A:, 
Q„:= " !4tr 1/2 , fee N . 

A V when « = 2^+1, 

Here we again use A := I. Since the differential operators are just special cases of distri- 
butional operators, the generalized Sobolev space Hp(R d ) defined by P is the same as that 
derived in [22 Example 4]. Therefore we can combine Theorem l4.2ll4.6l and the techniques 
of the proof for [22 Example 4] to obtain that 

N° (R d ) = n r (R d ). 

Moreover, it is easy to verify that Hp(R.' / ) c "H"(R rf ) for each n e N. According to the 
Sobolev embedding theorem [1], we also have H P (R d ) c Re(C^°(R' / )). However, H P (R rf ) 
does not contain polynomials. If / e Re(C™(R << )) and there is a positive constant C such 
that HC/Hl^r,/) < C M for each a e N^, then / e H P (R rf ) which implies that / e Af° (R^). 

If we replace the test functions space to be 3$, then we can further think of the Gaussian 
function <t> is a (full-space) Green function of L := exp(--^A), i.e., L<t> = do and 0,5q e 

where S> = C~(R d ) and its dual space Si' are defined in C] Chapter 1.5]. 



6 Extensions and Future Works 

In this paper we have presented a unified theory for the generation of conditionally positive 
definite functions of order m as (full-space) Green functions with respect to a distributional 
operator L := P* r P with an appropriate vector distributional operator P. These even Green 
functions G e Re(C(R d )) n SI can be used as basic functions of a translation invariant 
meshfree kernel-based approximation method of the form (I2.1b -l l2~!2l l. Our analysis is limited 
to this translation invariant setting which does not address the fully general situation with 
kernels of the form K{x,y), but is more general than the radial setting. 

In Section [5] we were able to show that many different types of "splines" and radial 
basis functions can be treated with our Green function framework. Thus, reproducing ker- 
nel Hilbert space methods can be viewed as a natural generalization of univariate splines 
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(including such variations as tension splines). Other forms of univariate splines such as 
smoothing splines or regression splines can be covered using a related least squares frame- 
work, and multivariate generalizations of these methods are widely used in statistics and 
machine learning. 

We only consider real-valued functions as candidates for the generalized Sobolev spaces 
and Green functions in this paper, but all the conclusions and the theorems can be extended 
to complex- valued functions in a way similarly to [21|. Hp(R rf ) may not be complete even 
though we extend it to complex-valued functions. However, its completion is isometrically 
embedded into the tempered distribution space S' and has the explicit form 

H P (R rf ) = [T € S' : PjT e L 2 (R d ), j=l,...,n), if P = (P u ■ ■ ■ ,P„f . 

The vector distributional operator P can be further constructed by pseudo-differential 
operators. Therefore their generalized Sobolev spaces Hp(R rf ) are isometrically equivalent 
to the Beppo-Levi type spaces X"'(R d ). The paper [3| shows that the radial basis function 
under tension may be associated to a pseudo-differential operator in a Beppo-Levi space 
type. For example, if P := (co T f r b m /d^', ■ ■ ■ , (m\/a\) U2 « T TZr, ■ ■ ■ , cj T Tff" jdx^f , then 

H P (R d ) = X?(R d ) := [f e Re(L' 1 oc (R rf )) n SI : co T D«~f e L 2 (R rf ), \a\ =m,ae N d ) , 

where T is a distributional Fourier transform map and w T (x) := \\x\\l, < r < 1. However, 
P may not satisfy the condition of Theorem 14.21 We have reserved these situations for our 
future research. 

Unfortunately, it is sometimes difficult for us to solve a Green function matching the 
conditions of Theorem 14.2 l even if the vector distributional operator P satisfies the conditions 
of Theorem l4.2l However, there is usually an even Green function G e Re(C(R rf \ {0})) n SI. 
This means that the Green function merely has a singular point at the origin. According to 
our numerical tests of some cases, we find that this kind of Green function can still play 
the role of a basic function for the construction of a multivariate interpolant s f\ X via ( 12.11 1- 
d2.2b after some techniques to remove the singularity. One of the numerical tests is a two- 
dimensional example as below. Let P := (A, crV T ) T with cr > and the Green function with 
respect to L := P* r P = A 2 - a 2 A be given by 

G{x) := -^—2 (K Q (o- \\x\\ 2 ) + log (o- \\x\\ 2 )) , x e R 2 , 

where z <-> K v (z) is the modified Bessel function of the second kind of order v. We can use 
a transformation to remove the singularity of G as follows: 

G r (x) := -JLf (K (o- \\x\\ 2 + r) + log (cr ||x|| 2 + r)) , x e R 2 , r > 0. 

We guess that the interpolant via this modified Green function may be used to approximate 
functions belonging to the related generalized Sobolev space. 

We merely consider the Lebesgue measure here. However, we can further generalize our 
results to other measure spaces {Q, Sq,p), where Q c R d and Sq is the Borel set of Q. We 
suppose that the bijective map 

3K: (£>,S ,/i)-> (R'.Stf) 
is differentiable at every point of Q such that 



Ap(x) = |det (Jy\{x))\ dx, where Jy\{x) is the Jacobian matrix of 3{ at x. 
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According to the Radon-Nikodym Theorem [2 ] it is not difficult to gain similar conclusions 
when we transform the generalized Sobolev space to be 



Finally, we do not specify any boundary conditions for the (full-space) Green func- 
tions. Thus we may have many choices of the Green functions with respect to the same 
distributional operator L. In our future work we will apply a vector distributional operator 
P := (Pi, ■ ■ ■ , P„ p ) T and a vector boundary operator B := (Bi, ■ ■ ■ , B„ b ) T on a bounded do- 
main Q to construct a reproducing kernel and its related reproducing-kernel Hilbert space 
(see [ 8 1). We further hope to use the distributional operator L to approximate the eigenvalues 
and eigenfunctions of the kernel function with the goal of obtaining fast numerical meth- 
ods to solve the interpolating systems ( 12. 1 t - j2T2T > similar as fast multipole methods in 112 1 1 
Chapter 15]. 
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